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■ Abstract 



c3 



This paper develops robust test procedures for testing the intercept of a simple re- 
gression model when it is apriori suspected that the slope has a specified value. Defining 
unrestricted test (UT), restricted test (RT) and pre-test test (PTT) corresponding to the 
unrestricted (UE), restricted (RE), and preliminary test estimators (PTE) in the estima- 
tion case, the M-estimation methodology is used to formulate the M-tests and derive their 
asymptotic power functions. Analytical and graphical comparisons of the three tests are 
obtained by studying the power functions with respect to size and power of the tests. It is 
' shown that PTT achieves a reasonable dominance over the others asymptotically. 
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o ■ 

O ■ 1 Introduction 

# >: 

In recent years many researchers have contributed to the estimation of one parameter in the 
presence of uncertain prior information on the value of another parameter. In general, inclusion 
of non-sample prior information improves the quality of inference. In spite of plethora of work 
in the area of improved estimation using non-sample prior information (c.f. Saleh, 2006), very 
little attention has been paid on the testing of parameters in the presence of uncertain prior 
information. It may be a normal expectation that testing of one parameter after pre-testing 
on another would improve the performance of the ultimate test in the sense of better power 
and size of the ultimate test. In this paper, this improvement is achieved by using a robust 
test, namely score type M-test defined along the line of M-estimation methodology. 
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If the underlying distribution is known and the assumed model holds, the statistical test 
that offers the most powerful test is the classical likelihood ratio test (LRT). However, this 
parametric test is generally very non-robust, even to small departures from the assumed dis- 
tribution (c.f. Huber, 1981, p. 264, Jureckovd and Sen, 1996, p. 408). Several robust tests are 
suggested in literature to tackle the problem. For example, Huber (1981, p. 264) suggests some 
censored likelihood ratio type test and shows the test possesses a minimax property. The 
test however does not work out conveniently for composite null hypothesis against composite 
alternative testing problem (c.f. Jureckovd and Sen 1996 p. 407). The Rao's efficient score 
statistic could be tempted on the least favorable distribution to obtain a robust test statistic. 
Sen (1982) suggests a score type statistic by replacing the robust Rao score test statistic by 
an M-statistic. The score-type M-test has several advantages. The construction of the score- 
type M-test needs less parameters to estimate than the robust LRT suggested by Schrader 
and Hettmansperger (1980) yet both are equiemcient (c.f. Sen, 1982) and are applicable to 
composite hypothesis testing. 

The properties of unrestricted estimator (UE), restricted estimator (RE) and pre-test es- 
timator (PTE) have been investigated by many authors (Khan and Saleh, 2001, Khan et al., 
2002). Most of the studies are based on normal or t-models and the results are non-robust. 
In the studies, the PTE (a linear combination of UE and RE) possesses a small quadratic risk 
when the distance parameter are large and too close to zero, that makes it the best choice 
over the other two estimators. Instead of least squares (LS) and maximum likelihood (ML) 
estimators, the properties of UE, RE and PTE are also studied in the framework of general 
robust estimators, explicitly, M-estimators. As such, a robust estimator namely the prelimi- 
nary test M-estimator (PTME) are proposed for linear models (Sen and Saleh, 1987). In this 
paper, three tests correspond to the UE, RE and PTE are defined. They are unrestricted test 
(UT), restricted test (RT) and pre-test test (PTT). The study of the properties on these tests 
formulated using robust statistics is unavailable in the statistics literature. 

The properties of size of the pre-test as well as the power of the test followed by pre-test 
have been studied in parametric cases (Bechhofer, 1951, Bozivich et al., 1956). After almost 
three decades, the effect of pre-test (on slope) on the size and power of the ultimate test (on 
the intercept) are investigated for rank-based nonparametric cases by Saleh and Sen (1982). 
However, there are some limited discussions in investigating the power of the PTT discussed 
in the paper. To author's knowledge, no research has been done in the investigation of the 
performance (size and power) of the ultimate test following a pre-test in linear models that 
is formulated using the score-type M-test. Since the M-estimation method is more popular 
compared to the other robust methods, it is incomplete to ignore the study of the performance 
characteristics of the power function after pre-testing based on M-test. M-estimation is known 
for its flexibility and well defined for a variety of models for which MLE is also defined (Huber, 
1981, p. 43, Jureckovd and Sen, 1996 p. 80). In this paper, the study on the power of test after 
pre-testing using M-test is considered for a simple linear regression model. 
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Consider a simple regression model of n observable random variables, JQ, i = 1, . . . , n 

X i = + Pc i + e i , (1.1) 

where the errors e^'s are from an unspecified symmetric and continuous distribution function, 
Fi, i = 1, . . . , n, the q's are known real constants of the explanatory variable and 6* and (3 are 
the unknown intercept and slope parameters respectively. 

We wish to test the significance of the intercept parameter under various conditions on the 
slope parameter. Basically, testing the intercept of a simple linear regression depends on the 
knowledge on the slope. We may have the case when the slope is unspecified. For this case, the 
slope is treated as a nuisance parameter in testing the significance of the intercept and we refer 
the test as the unrestricted test (UT). We may also have the case when the value of the slope 
is specified (say zero) and thus we return to the situation of testing the location parameter. 
The test on the intercept after specifying the value of slope is defined as the restricted test 
(RT). Besides these two cases, if the value of the slope is suspected to be close to (or any 
specified value), a natural action is to remove the uncertainty of the suspicious value of the 
slope by performing a test on the slope before testing the intercept. If the null hypothesis of 
the pre-test is rejected, then we use the UT, otherwise the RT. This is in line with definition 
of the PTE in the estimation problem. For the final case, the ultimate test following a pre-test 
is defined as the pre-test test (PTT). Obviously the preliminary test (on the slope) affects the 
power and size of the ultimate test (on the intercept). To simplify, 

• The unrestricted test : Test function 4>^ T is designed for testing Hq : = when (3 is 
unspecified, 

• The restricted test: The test function cj)^ T is designed for testing H$ : 9 = when (3 is 
(specified) and 

• The pre-test test : The test function 4>n TT is designed for testing Hq : 6 = following a 
pre-test on the slope. 

The objectives of the current research are to propose a robust test statistic based on M- 
statistic to formulate the asymptotic power functions for testing the intercept after pre-testing 
on slope and to carry out investigations on the asymptotic properties of this power function. 

Along with some preliminary notions, the method of M-estimation is presented in Section 
2. In Section 3, three statistical tests concerning testing on the intercept, namely, the UT, RT 
and PTT are proposed for the three different cases mention earlier. Further, the asymptotic 
distributions of the test statistics are derived in Section 4. In Section 5, the asymptotic power 
functions of the tests are derived. Section 6 is devoted to the analytical results comparing 
the asymptotic power functions of the UT, RT and PTT while the investigation of the power 
functions through an illustrative example is presented in Section 7. The final section presents 
discussions and concluding remarks. 



3 



2 The M- estimation 



Given an absolutely continuous function p : 3ft — > 3i, M-estimator of and /3 is denned as the 
values of and j3 that minimize the objective function 

n 

Y^piXi-e-Pa). (2.1) 

i=l 

M-estimator of and /? can also be defined as the solutions of the system of equations, 

SXiMXi) = ELtMXi-e-fc) = o, 
£™=i#PQ) = ^cnHXi-e-p*) = o. 

If p is differentiable with partial derivatives ipg = dp/d9 and tpg = dp/d/3, then the M- 
estimators that minimize the function in (|2.ip are the solutions to the system (I2.2p . On the 
contrary, the M-estimators obtained from solving system (|2.2I) may not minimize equation ()2.ip 
(c.f. Caroll and Rupert, 1988 p. 210). The system of equations (I2.2p may have more roots, 
while only one of them leads to a global minimum of (12. ip . Jureckovd and Sen (1996) have 
given proof that there exists at least one root of (|2.2p which is a y/n - consistent estimator of 
9 and (3 under some conditions [c.f. p. 215 - 224]. The ip function is decomposed into the sum 

Ip = lp a + Ipc + Ips, 

where 

(a) ip a is absolutely continuous function with absolutely continuous derivative. 

(b) ip c is a continuous, piecewise linear function with knots at p,%, . . . , p,/~, that is, constant in 

a neighborhood of ±oo and hence its derivative is a step function ip' c {z) = a v , p v < z < 
p v +i, v = 0, 1, . . . , k where ao, . . . , e K, «o = «fe = an d —oo = po < p\ < . . . < 
Pk+i = oo. We assume that f(z) = dF Jf is bounded in neighborhoods of S^, . . . , S^ k . 

(c) ips is a nondecreasing step function, ip s {z) = X v , q v < z < q v +i,v = l,...,m where 

— oo = qo < qi < . . . < <j[ m +i = oo and — oo < Xq < X\ < . . . < X m < oo. We assume 
that < f(z) = (d/dz)F(z) and f'(z) = (d? /dz 2 )F(z) are bounded in neighborhoods of 
S qi , • • • , Sq m . 

The asymptotic result under conditions Ml to M5 of Jureckovd and Sen (1996, p. 217) is used 
in this paper. Further assume that all ip a , ip c and ip s are nondecreasing and skew symmetric 
that is tpj{—x) = —vpj(x), j = 1,2,3. Let F be symmetric about 0, so that 



ip(x)dF(x) = 0. 



Assume that 



<7q = / tp 2 (x)dF(x). (2.3) 

J — oo 

Following Jureckovd and Sen (1996, p. 217), two cases are considered: 
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(i) if ip s = then 

/oo 
(iP> a (x)+^' c (x))f(x)dx. (2.4) 
-oo 

(ii) if ipa = 4>c = 0, then 

1 = J2(K-K-i)f(S qv ). (2.5) 

Further assume that o"o and 7 are both positive and finite quantities. Let the distribution 
function, F be continuous and symmetric about zero and have finite Fisher information, 

/•oo 

1(f) = / {f'(x)/f(x)} 2 dF(x), (2.6) 



where f'(x) = (d/dx)f(x) = (d 2 / dx 2 )F(x). Assume that 
(i) there exists finite constants c and C*(> 0) such that 

lim c n = c and lim n~ x Cf = C* 2 (2.7) 

with 

n n 

Y,Ci and Cf = j2c 2 -nc 2 n (2.8) 



n- 1 



i=i i=i 

both exist. 



(ii) the Cj's are all bounded, so that by (i), 

max (cj - Cn) 2 /C* 2 -> 0, as n -> 00. (2.9) 

l<i<n 

Let V : ^ ~^ ^ be nondecreasing and skew symmetric score function. For any real numbers a 
and b, consider the statistics below 

n 

M ni {a,b)=Y,^{X l -a-bc l ), (2.10) 
i=i 

n 

M n2 (o,6) = 5^c i ^(X i -a-6c i ). (2.11) 
i=i 

Let /3 be the constrained M-estimator of (3 when 9 = 0, that is, /3 is the solution of M n2 (0, b) = 
and it may be conveniently be expressed as 

= [sup{6 : M n2 (0,6) > 0} + inf{6 : M„ 2 (0,6) < 0}]/2. (2.12) 

V v ' V v ' 

bi b-2 

Any value b\ < b < 62 can serve as the estimate of M n (0, b). However (3, the centroid (and 
the median) of the interval [61, 62] achieves the smallest maximum bias among all translation 
invariant functionals (Huber, 1981 p. 75), hence it is a robust estimator with optimum robust- 
ness properties. Similarly, let 6 be the constrained M-estimator of 9 when (3 = 0, that is, 9 is 
the solution of M m (a, 0) = and conveniently be expressed as 

9 = [sup{a : M m (a, 0) > 0} + inf{a : M m (a, 0) < 0}]/2. (2.13) 
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By the same argument as above, 9 is a robust estimator. 

The preliminaries notations and assumptions in this section are used to develop the tests 
and construct the asymptotic power functions of the UT, RT and PTT. In the next section, the 
UT and RT are proposed. The PT (testing on slope) is introduced and the PTT is constructed. 



3 The UT, RT and PTT 

Sen (1982) shows that the asymptotic distribution of 

n-V 2 M n2 (0, 0) - N(0, alC* 2 ) (3.1) 

under : (3 = 0. The consistency of [S^] 2 = n^ 1 Y17=i V ;2 (^i — as an estimator of cr 2 
follows from Jureckovd and Sen (1981). Hence, a test statistic A n = M n2 {9, 0)[C* sl 3 ^] -1 is 
proposed by Sen (1982). The advantage of this test statistic (score-type M-test) is that it does 
not require the computation of the M-estimates or the estimation of functional 7. 
By the same way, it is easy to show that the asymptotic distribution of 

n-V 2 M ni (0, p) - N(0, olC* 2 /{C* 2 + c 2 }) (3.2) 

under Hq : 8 = 0. By the same token, the consistency of [si 1 "*] 2 = n~ x Y17=i ^(Xi — as 
an estimator of Oq follows. 

These two asymptotic distributions results are useful to construct suitable test in formulat- 
ing the asymptotic power function for testing the intercept after pre-testing. We are primarily 
concerned with statistical tests for the parameter 9 as well as (5. In essence we need to consider 
four test functions correspond to the four proposed tests. 

3.1 The unrestricted test (UT) 

If (3 is unspecified, the designated test function is eff^ with the null hypothesis Hq : 9 = 0. The 
testing for 6 involves the elimination of the nuisance parameter (5. It follows that M n2 (0, b) 
is decreasing if b is increasing (Jureckovd and Sen, 1996, p. 85) and under local hypothesis, 
: (3 = 0, M n2 (0, 0) has expectation 0. Then let 

P = (sup{6 : M n2 (0, b) > 0} + inf {b : M„ 2 (0, b) < 0})/2. 

Then (3 is a translation invariant and robust estimator of [3. 

We consider the test statistic T^ T = M ni (0,/3) where under Hq, asn^oo, 

rpUT 

N(0,1) 



r (l)o(l) 2 



with CP =n- n 2 c 2 J Zc 2 = nCf/(Cf + nc 2 n ) and [s£ ] ] 2 = Z ^{^ - P^/n. 
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3.2 The restricted test (RT) 

If f3 = 0, the designated test function is 4>^ T for testing the null hypothesis Hq : 9 = 0. The 
proposed test statistic is T^ T = M ni (0, 0). Note that for large n, under Hq : 9 = 0, j3 = 0, 

n -i/2 T RT = n -i/2 Mni(0;0) ^ N(0,a 2 ), (3.3) 
where ct 2 = J^ oo ^ 2 {x)dF{x) (see Sen, 1982, eq 3.7). 



3.3 The pre-test test (PTT) 

In this section, test on slope is proposed first and followed by the construction of the ultimate 
test for testing intercept. 



For the preliminary test on the slope, the test function, 



lPT 



is designed to test the null 



hypothesis Hq : = 0. The proposed test statistic is T^ T = M n2 (9, 0) where 

9 = (sup{a : M m (a, 0) > 0} + inf {a : M ni (a, 0) < 0})/2 
is a robust estimator. Under H^\ asn-> oo, 



r (3) q&Y 



iV(0,l), 



where c£ 3) = nc 2 n = C* 2 and [S { ^] 2 = £ ^ 2 ( Xl - 9)/n. 

The consistency of [Sn^] 2 , [Sn^] 2 = ^2"ip 2 (x)/n and [sl 3 ^] 2 as estimators of Oq follows by 
law of large number (Jureckovd and Sen, 1981). 

Now, we are in a position to formulate a test function 4>!^ TT to test Hq : 9 = following 

(i) 

a preliminary test on p. First, we consider the case where all of q>n , j = 1, 2, 3 are one-sided 
test. Let us choose positive numbers ccj (0 < ay < 1) and real values £n}aj, j = 1)2,3, such 
that for large sample size, 

P[TV T >£% T ai \H*:9 = 0]= ai , (3.4) 
P[T* T > \H :9 = 1 (3 = 0]=a 2 , (3.5) 
P[T? T ylZjH^ :/3 = 0]=a 3 , (3.6) 

where £njoy i s the critical value of Tn at the ay level of significance. Let $(x) be the standard 
normal cumulative distribution function, then 



<J>(r a ) = 1 - a, for < a < 1. 
Using equations (13. 3D . (|3.5p and (13.7|) . we obtain 
l-aa = P[7T<0 



(3.7) 



P 



P 



n 



1 / 2 r r f T /V^ )2 <-- 1/2 e 2 /V^ )2 



n 



-1/2 T RT I 



± n /V u 



<tT 1/2 C/ 



*("- 1/a C/^o), 
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where sffi = ^2'4'' 2 ( x i)/ n - Thus as n — » oo we have 



„-l/2 fRT n -l/2pKT 

' r a2 = =2— (say). (3.8) 



c(2) 2 V CT 6 
By the same way, using (|3.2j) and (|3.4|) . we observe that as n —* oo, 

(say), (3.9) 



n -l/2 fUT n -l/2fUT 



where = X] V ;2 ( a; j ~~ f3~Ci)/n, and ci 1 "* = re — ra 2 c 2 / ^ c 2 . Also by (|3.ip and (|3.6p . as n — > oo, 

-1/2/PT -1/2/PT 

? 2 "' tt3 - ^ = (say), (3.10) 

where si 3 " 1 = X] ^ 2 (^i — and C* 2 = ^2cf — nc^. 

Now we may write 

as the test function for testing .ffg : = after a pre-test on /3. Note that /(.A) stands for the 
indicator function of the set A. It takes value 1 if A occurs, otherwise 0. The function enables 
us to define the power of the test <^ TT , that is given by 

K TT {0) = E{d> p J T \e) 

= P[T^ T <iZs^ n RT >iZM+nT^ T >iZ^r>^\0]. (3.12) 

In general, the power of the test 4>n TT depends on ax, 012, 0:3, 6, n as well as (5. Note that the 
size of the ultimate test special case of the power of the test when 9 = 0. Since the 

nuisance parameter (3 is unknown, but, suspected to be close to 0, it is of interest to study the 
dependence of both a PTT and H PTT (9) on (5 (close to 0). 



4 Asymptotic distribution of Z^ T , T^ T and T 7 



PT 
n 



This section is devoted to the asymptotic distribution theory of statistics involved in proposing 
the PTT. The asymptotic joint distributions of [T^ /T ,T r f T ] and [T^ T ,T pt ~\ are derived. The 
results are used in the next section in the construction of the power function of the UT, RT 
and PTT. 

Let {K n } be a sequence of alternative hypotheses, where 

K n : (9,(3) = (n- 1 / 2 Ai,n- 1 / 2 A 2 ), (4.1) 

with Ai, A2 are (fixed) real numbers. 

Interested readers are referred to Jureckovd (1977), Sen (1982) and Jureckovd and Sen 
(1996, p. 221) for the following asymptotic properties: 
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(i) under Hq : 9 = 0, (3 = 0, as n grows large, 



n 



-1/2 



M ni (0,0) 
M„ 2 (0,0) 



No 








1 c 

~<*2 , -2 



(4.2) 



(4.3) 



c C* z + c 

where A^- , • ) represents a bivariate normal distribution with appropriate parameters. 

(ii) under H : 9 = 0, (3 = 0, 

sup{n- 1 / 2 |M„ 1 (a,6) - M ni (0, 0) + n 7 (a + 6c) | : 
|a| < n-V2x, |b| < n-V2 K } ^ p 
as n — > oo and K is a positive constant. 

(iii) under H : 9 = 0, /3 = 0, 

sup{n- 1 /2| Mn2 ( a) & ) _ Mn2 ( , o) + n 7 {ac + b(C* 2 + c 2 )}| : 

|a| < n-V 2 K, \b\ < n'^K} 



(4.4) 



as n — > oo and is a positive constant. The above convergence is in probability, means 
the sequences of random variables converges in probability to a fix value (0). 

An important concept that dominates the asymptotic theory of statistics is the contiguity 
of probability measures (Jureckovd and Sen, 1996, p. 61). Contiguity arguments are a technique 
to obtain the limit distribution of a sequence of statistics under the alternative hypothesis from 
a limiting distribution under the null hypothesis (c.f. van der Vaart, 1998 p. 85). Let {P n } and 
{Qn} be two sequence of probability measures defined in a measure spaces (fi n , B n , fj, n ). In the 
LeCam's first lemma (Hdjek et al., 1999, p. 251), if 

logL n £ N(~ a 2 , a 2 ) (under {P n }), 

then {Q n } is contiguous to {P n }- Here the likelihood ratio statistic L n is given by 



q n /p n for p n > 
1 for p n = q n = 
oo for = p n < q n , 



where {p n ,Qn} are the sequence of simple hypothesis densities. In the LeCam's third lemma 
(Hdjek et al., 1999, p.257), if 



D 



N 2 



Mi 

M2 



logL„ 

where T n is a statistic with ^2 = — | CJ 22 ) then 



en C12 
C21 C22 



(under {P n }), 



T n ^ AT(/xi + (7i2, cm) (under {Q„}). 



9 



The LeCam's second lemma (Hdjek et al., 1999, p. 253) gives conditions when logL n 
1 

2 



D 



N{-\a 2 ,a 2 ). 



The concept of contiguity is more popular in R-estimation (rank statistic) than in M- 
estimation. However, Sen (1982) uses the contiguity of probability measures under H n : (3 = 
n _1 / 2 A to those under H' Q : (3 = to find the asymptotic distribution of n -1 / 2 [M ni (9, 0), M n2 (9, 0)] 
under H n . In this paper, the contiguity concept is utilized to find the asymptotic distributions 
of statistics n-V2[ T n RT , T„ PT ] and rT x l 2 [ T^ t , T? t ] under K n . 

4.1 Asymptotic distribution of n~ x l 2 T^ T and n~ x l 2 T^ T 

Following Jureckovd and Sen (1996, p. 259), let {P n } and {Q n } denote the probability distri- 
butions with the densities p n = YliLi fi-^i) an d Qn = Yli=i f(^i ~ n~ l l 2 \i — tc n i) of the null 
hypothesis Hq and the alternative hypothesis K n , respectively, where c n i = (q — c n )/C*, i = 
1, . . . , n. Note that under (jl.ip . (|2.7p . (|2,9p and (|4.1|) . the contiguity of the sequence of proba- 
bility measures under {K n } to those under Hq follows from LeCam's first and second lemmas 
(Hdjek et al., 1999, Chapter 7). We are interested in the asymptotic distribution of the joint 
statistics [rrV 2 T* T ,TrV 2 T£ T \ . Here convergence of [n' l / 2 T^ T , n~ 1 l 2 T^ T ] +T -» [0,0] un- 
der H implies [n" 1 / 2 !^, n" 1 / 2 !^ 7 ] + T -> [0,0] under {K n } since the probability measures 
under {-ftT n } is contiguous to that of under Hq (c.f. Saleh, 2006, p. 44). Here, T is a known 
vector. 

Under H Q : 9 = 0, (3 = 0, with relation to flO} and (jO|) . 



n-V 2 M ni (0,0) 
n-^M^^O) 



n-V2M ni (0,0) 
n-^M^ (0,0) 










" " 


+ 


n 1//2 7^c 








or equivalently 



and 



^ 1/2 M m (0,0) = n -^ 2 M ni (0,0) +Op(l) 



n-^M^^O) =n- 1 / 2 M n2 (0,0) -n 1 / 2 7 ^c + o p (l). 
Note also that under i?o : = 0, ,5 = 0, 

n- x / 2 M ni (0, 0) = n- x / 2 M ni (0, 0) - n 1 / V~ + o p (l). 

Recalling definition (I2.13p . the equation (14. 8|) reduces to 

n- 1 / 2 Af ni (0,0) = n^V + o^l), 

and hence equation (j4.7l) becomes 

rT^M^ (0, 0) = n-^Mn, (0, 0) - n^M^ (0, 0)c + o p (l). 

Therefore, under Hq, equation (|4.5p becomes 



(4.5) 

(4.6) 
(4.7) 

(4.8) 
(4.9) 
(4.10) 
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n-V2M ni (0,0) 
n-V2M n2 (^0) 



n-V2M ni (0,0) 
n-V2M n2 (0, 0) - n- l ' 2 M ni (0, 0)c 



n-V2M ni (0,0) ' 




1 " 




" n-V2M ni (0,0) " 




" " 






-c 1 




n-^M^ (0,0) 








(4.11) 



Now utilizing the contiguity of probability measures under {K n } to those under Hq, the equa- 
tion (|4.11|) implies that 

" n- 1 /2Af ni (0,0) 
under {K n } is asymptotically equivalent to the random vector 



1 

-c 1 



n-V2M ni (0,0) 
n-V2M n2 (0,0) 



under Hq. But the asymptotic distribution of the above random vector under {K n } is the same 
as 



1 
-c 1 



n-V2 Mni (_ n -i/2 Ai 
n-V2M n2 (_ n -i/2 Al 



-n-V2 A2 ) 
-n-V2 A2 ) 



under Hq by the fact that the distribution of M ni (a, b) under 6 = a, (3 = b is the same as that 
of M ni (6 -a,/3-b) under = 0,0 = 0, and similarly for Af n2 (0,0) (c.f. Saleh, 2006 p.332). 



Note that under H { 



o 



0,0 = 0, with relation to flO} and 



n 



- 1 / 2 M ni {-n- 1 / 2 X 1 ,-n- 1 / 2 X 2 ) 
- 1 /2 Mn2 (-n- 1 /2 Al ,-n- 1 /2A 2 ) 



n-^M^^O) 
n- 1/2 M„ 2 (0,0) 



(4.12) 



7(Ai + A 2 c) 




" " 


7{A 1 c + A 2 (C* 2 + c 2 )} 








Hence, by equation (|4.2|) . under H[ 



n- 1 /2 Mni (-n- 1 /2A 1 , 



-n-V 2 \ 2 ) 
-n-V2 A2 ) 



N 2 



7(Ai + A 2 c) 
7 {A 1 c + A 2 (C* 2 + c 2 )} 



4 



c 

C* 2 + . 



(4.13) 



Thus, the distribution of 



" n" 1 /2 T fiT " 









n-V2M ni (0,0) 
n-^M^^O) 



under {/C n } is bivariate normal with mean vector 



1 
-c 1 



7(Ai + A 2 c) 
7 {A 1 c + A 2 (C* 2 + c 2 )} 



7(Ai + A 2 c) 
7A 2 C* 2 
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and covariance matrix 



1 

-c 1 



1 c ) 


1 " 




' 1 





c C* 2 + c 2 ) 


-c 1 







c* 2 



(4.14) 



Since the two statistics n l l 2 T^ T and n l / 2 T^ T are uncorrelated, asymptotically, they are 
independently distributed normal variables. 

4.2 Asymptotic distribution of n~ l ^T^ T and n~ l / 2 T^ T 
Under Hq : 9 = 0, {3 = 0, with relation to (14.30 and (14.40 . as n — > oo, 



n- 1/2 M ni (0^) " 




" n-V2M ni (0,0) " 








' " 






n-^M^ (0,0) 




n 1//2 7#c 








or equivalently 



and 



n- 1 / 2 M m (0, 0) = rT x l 2 M nx (0, 0) - n 1 / 2 7 /3c + o p (l) 



rr^MfJM) = n" 1/2 M n2 (0,0) - n 1 /^ + o p (l). 
Again under i?o : # = 0, (3 = 0, 

r^M^O,/?) = n~ l / 2 M n2 (0,0) - n 1 / 2 7 /3(C* 2 + c 2 ) + o p (l). 

But by definition (12,120 . we have 

n-^Mn, (0, 0) = n 1 ' 2 ^* 2 + c 2 ) + o p (l), 

so equation (14,160 becomes 

n~ x l 2 M n , (0, j3) = rr l / 2 M m (0, 0) - n'^M^ (0, 0)c/(C* 2 + c 2 ) + o p (l). 

Hence, under Hq, using equations (14.100 and (|4.20j) . equation (14.150 becomes 



(4.15) 

(4.16) 
(4.17) 

(4.18) 
(4.19) 
(4.20) 



n-V 2 M ni (0j) 



n-V2M ni (0,0) - n-V2M n2 (0,0)c/(C* 2 + 
n-^M^ (0, 0) - n-^M^ (0, 0)c 



n-V2M ni (0j) ' 




1 -c/(C* 2 + c 2 ) " 




" n-V2M ni (0,0) ' 




" " 


n-^M^^O) 




-c 1 




n-V2M na (0,0) 








• (4-21) 



Now by using the contiguity of probability measures under {K n } to those under Hq, the 
equation (14.210 implies that 
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under {K n } is asymptotically equivalent to the random vector 



1 -c/(C* 2 + c 2 ) 
-c 1 



n-V2M ni (0,0) 
n~ x l 2 M n% (0,0) 



But the asymptotic distribution of the above random vector under {K n } is the same as 



1 -c/(C* 2 + c 2 ) 
-c 1 



n- 1 / 2 M n2 {-n- 1 / 2 X 1 ,-n-y 2 X 2 ) 



under Hq by the fact that the distribution of M m (a, b) under 8 = a, [5 = b is the same as that 
of M ni (6 -a,(3-b) under 9 = 0,(3 = and similarly for M n2 (0,0) (c.f. Saleh, 2006 p.332). 
Then it follows that by equation (|4.13|) . 



n~ 



L n 

-l/2rpPT 



- n 



n-V 2 M n2 (e,0) 



is bivariate normal with mean vector 



1 -c/(C* 2 + c 2 ) 
-c 1 

and covariance matrix 

1 -c/(C* 2 + c 2 ) 
-c 1 



7(Ai + A 2 c) 
1 {X 1 c + X 2 {C* 2 + c 2 )} 



7A!C* 2 /(C* 2 
7A2C* 



c C* 2 + c 2 



1 -c/(C* 2 + c 2 ) 
-c 1 



*2 



C**H<c* 2 + c 2 ) -c C* 2 /(C* 2 + c 2 ) 
-c C* 2 /(C* 2 + c 2 ) C* 2 



(4.22) 



Clearly, the two test statistics n 1 / 2 T^ T and n l / 2 T PT are not independent, rather correlated. 



5 Asymptotic properties for UT, RT and PTT 



In this section, the asymptotic power functions of (ff l T , <ftn T an d 4> PTT are derived by using 
the results obtained in the previous sections. 

Under {K n }, the power function of 4> PTT is given by 

U PTT (X 1 ,X 2 ) = E^ PTT \K n ) = P[T n PT < l PT a3 ,T* T > ^jKn] 



+ P[T n PT > i P n T as ^ T > t U n T ai \K n ]. (5.1) 
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Note that 



P\T PT < £ PT T RT > £ RT \K 1 

n -i/2 T PT _ lX2C *2 n -i/X T a3 - 7 A 2 C 



P 



*2 



< 



Sn ^ Cn 2 / n 



n 



-i/2 T RT _ 7(Ai + ^ n-V 2 ^ 2 - 7 (Ax + A 2 c) 



o(2) 2 



o(2) 2 



P 



n 



a 2 C* 2 



n -l/2 T HT _ 7(Ai + Aa5) ^ „ 



-1 



/2 ^L"7(Ai + A 2 c) 



as n -> 00 (5-2) 



(2) 2 f'A) 2 9 o 

because the limit of /Sn and Sn are <7q and C* jn —>■ C* as n — > 00. 



From equations (13, 7jl . (j3.8l) and (|3.10|) and (j4.14jl . the probability statement in (15. 2p be- 
comes 

$(r a3 - 7A 2 C*/ao)[l - $(r Q2 - 7 (Ai + X 2 c)/a )]. 
Note that T RT and T PT are independent by equation (|4.14p . 

Define d{q\ ,q 2 '■ p) to be the bivariate normal probability integral for random variables x and 
V, 



d(qi,q2;p) 



1 



f°° r \ -{x 2 + y 2 -2pxy) \ 
J J exp| 2(1 _ p2) \dxdy, (5.3) 



27r(l-p 2 )V2„ ;| j<u 

where qi,q 2 are re& l numbers and — 1 < p < 1. Here d(q±,q 2 )p) is the complement of the 
cumulative density function of standard bivariate normal variable. 

Since S^ and both converge to a 2 , and C n 1 ^ /n — > C* 2 /(C* 2 + c 2 ) as n — > 00, we 

observe that 

P\T PT > i PT T UT >f T \K ] 

V 2 T PT - 7 A 2 C* 2 n-VH P L ~ l^ 2 



P 



n 



Sn 3) Cf/n 



> 



Sn ^ C^ 2 jn 



n -i/2 T UT _ lXl C* 2 /(C* 2 + C 2 ) n-V 2 ^ _ tAiC *2 /(c * 2 + - 2) 



p 



n-^ 2 T PT -j_\ 2 C* 2 > n^ 2 C 3 " - V - r 



*2 



a 2 C* 2 



n 



-i/2 T aT _ ^x xC **/(C* 2 + c 2 ) > W 1 / 2 ^ - 7 AiC* 2 /(C* 2 + c 2 ) 



f o$C* 2 /(C* 2 + c 2 ) 
asn-> 00. Further, the equation (|5.4p is written as 



a 2 C* 2 /(C* 2 + (?) 



(5.4) 



d(r a3 - jX 2 C*/a ,T ai - 7A1 v C* 2 / (C* 2 + c 2 )/a ; -c/\/C* 2 + c 2 



14 



by using equations (pT3j) . (13TTU1) . (|Q2j) and (H3D. Note that T^ T and T,f T are not independent 
because of (|Q2|) . 

Hence, the power function of <^ TT for the PTT becomes 



nr r (Ai, A 2 ) = E{<p p J T \K n ) - n m (A b A 2 ) 

= $(r Q3 - 7A 2 C*/(7o)[l - $(t Q2 - 7 (Ai + A 2 c)/a )] + 



d(r a3 -jX 2 C*/a ,T ai - jX^C* 2 /(C* 2 + c 2 )/a ; -c/VC* 2 + c 2 ). (5.5) 
Similarly, the power function of </>^ T for the RT is given by 

nf , (A 1 ,A 2 ) 



£(<^ T |K„.) 



P 



P 



-V 2 T«r - 7 (A! + A 2 c) . n" 1 /^ - 7 (Ai + A 2 c) 



> 



(2)' 



5, 



(2)' 



n 



^Jf - 7(Ai + A 2 c) ^ n-V 2 ^ 2 - 7 (Ai + A 2 c) 



(5.6) 



since Sn^ — > cr 2 ,. Combining equations (|3.7|) and (|3.8|) . the power function of <^ T becomes 

n RT (Ai, A 2 ) = 1 - $(r a2 - 7 (Ai + A 2 c)/a ). (5.7) 
Finally, the power function of 4>^ T for the UT is obtained as 
n^ T (A 1; A 2 ) = E(^ T \K n ) = P[T^ T >eH T ai \K n ] 
= P 



n 



l l 2 TH T - ^^/(C* 2 + c 2 ) n" 1 / 2 ^ - 7 A!^ 2 /(^ 2 + . 




since Sn } -> o-g. Further the power function for the UT is written as 

n^Ai, A 2 ) = 1 - $(r Q1 - 7 xJc* 2 /(C* 2 + c 2 ) /<7o) 



(5.9) 



using equations (j3.7[) and (13.91) . 

The asymptotic power functions for the UT, RT and PTT that are derived using M-test 
in this section are found to have the same form as that derived by using the rank statistic by 
Saleh and Sen (1982) though the methodology of M-estimation and R-estimation is different. 
Therefore, the investigation on the properties of the power of the M-test is similar to the power 
of the test based on rank statistic. 
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6 Asymptotic comparison 

This section gives analytic asymptotic comparison of the power functions of the UT, RT and 
PTT. 

If we consider c = in equation (|5.5|) , 

n PTT (Ai, A 2 ) = $(r a3 - 1 X 2 C*/a )[l - $(r Q2 - y\i/<r )] + 

[1 - d>(r Q3 - 7 A 2 C*At )][1 - *(r ai - 7 Ai/<r )]- (6.1) 

Letting a\ = a 2 = a and from equations (|5.7p . (|5.9p and (|6.1|) . we observe that the power 
functions for the UT, RT and PTT are the same, i.e. 

n c/T (A 1 , A 2 ) = LT pp (Ai, A 2 ) = n PTT (A l5 A 2 ) = 1 - <J>(r Q - 7 (Ai + X 2 c)/a ). (6.2) 

From equations (|5.5p and (j5.7|) . 

n^(A 1 ,A 2 )-n^(A 1 ,A 2 ) 

= 1 - $(r a2 - 7 (Ai + X 2 c)/a ) - $(r Q3 - jX 2 C*/a )[l - $(t Q2 - 7 (A X + X 2 c)/a )} 

- d(r a3 - jX 2 C*/a ,r m - jX iy /c* 2 /(C* 2 + c 2 )/a ; -c/Vc* 2 + c 2 ) 
= d(r a3 - 7 A 2 C*/Vo, T a% - 7 (Ai + X 2 c) / Gq] 0) 



d(r Q3 -"/X 2 C*/a ,T ai -^JC* 2 /^ + <?)/a ;-c/VC* 2 + <? ). (6.3) 



Letting a\ = a 2 = a, c > 0, A 2 > and Ai + A 2 c > Ai J C* 2 /(C* 2 + c 2 ), 
Result (i): n RT (Ai, A 2 ) > n PTT (Ai, A 2 ) from equation §3$ and 
Result (ii): n fiT (Ai, A 2 ) > n c/T (Ai,A 2 ) from equations <^7^ and ([Oil . 



On the contrary, taking ot\ = a 2 = a, c < 0, A 2 > and Ai + A 2 c < Ai y C* 2 / (C* 2 + c 2 ) 
fiesuft (hi): n PT (Ai,A 2 ) < n PTT (Ai,A 2 ) from equation (JO]) and 
Result (iv): n RT (Ai, A 2 ) < n l/T (Ai, A 2 ) from equations ([521) and 



From equations (|5.7p and (|5.9p . when Ai = A 2 = and «i = a 2 = a, we find n fiT = 
= a. Failure to satisfy the conditions does not means Result (i) and Result (hi) could 
not be obtained. But if Ai = 0, these conditions are always met. Hence, under Hq : = 0, 
a RT > a PTT and ot RT > a UT = a when c > and A 2 > 0. Letting ct\ = a 2 = a, we write 



n c/T (A 1 , A 2 ) - n PTT (A l5 A 2 ) = A + B, 



where A = [1 - $(r a - jX^ C**/{C* A + c 2 )/a )] - [1 - $(r Q - 7 (Ai + A 2 c)/<7 )] and B = 
d(r a3 - jX 2 C*/a ,T a - 7 (Ai + X 2 c)/a ; 0) - d(r Q3 - jX 2 C*/a , r a - -fX^C* 2 /(C* 2 + c 2 )/a ; 
-c/VC* 2 + c 2 ). For c > 0, then Ai+A 2 c > Xi^C* 2 /{C* 2 + c 2 ) and Ta-jX^ C* 2 /{C* 2 + c 2 ) > 
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T a - 7(Ai + A 2 c)/cr . Thus, A = [1 — $ 2 ] — [1 - $1] < because <3?i < $ 2 where 3>i 



^(^a - 7(Ai + A 2 c)/o- ) and $ 2 = $(r a - 7A1 y C* 2 /(C* 2 + c 2 )). We observe three cases 

U UT (X 1 ,X 2 )-U PTT (X 1 ,X 2 )^0 if £ = L4|, 

In a special case, Ai = = A 2 , A = and B > 0, thus, n l/T (0, 0) > n PTT (0, 0). 

When c > and A 2 > 0, the asymptotic size of the RT is larger than both UT and PTT. 
For c > and Ai = 0, the size of the PTT may also be smaller than that of UT (when A 2 is 
small). Similarly, for c < 0, a RT < a and a RT < a PTT while a FTT is more closer to a. 

Refer to equation (|5.5j) . as «3 — ► and r a3 — 7A 2 C*/<7o — > oo, <&(r a3 — 7A 2 C*/o"o) — > 1 



and ^(rQ.g — 7A 2 C*/cro, r ai — 7A1 y C* 2 / (C* 2 + c 2 )/ao; — c/ \/ C* 2 + c 2 ) — > because one of the 
lower limits is approaching infinity. Thus, we observe that 

n PTT (A l5 A 2 ) -»■ 1 - $(r Q2 - 7 (Ai + A 2 c)/ao) = n PT (Ai, A 2 ) as q 3 -» 0. (6.4) 

Whereas as 03 — ► 1 and Tq, 3 — 7A 2 C*/<7o — » —00, $(r a3 — 7A 2 C*/cro) — > and 



d(r Q3 - -yX 2 C*/a ,T ai - 7A1 VC* 2 /(C* 2 + c 2 )/a ; -c/VC* 2 + c 2 ) 



-» 1 - Hr ai - 7 A 1A /C* 2 /(C* 2 + c 2 )Ax ) 
because one of the lower limits is approaching negative infinity. Thus, we observe that 



n PTT (A!,A 2 ) -» 1 -*(r ai --yX iy /c* 2 /(C* 2 + c 2 )/a ) = U UT (X 1 ,X 2 ) as a 3 -» 1. (6.5) 

The analytical results in this section is accompanied with an illustrative example in investi- 
gating the comparison of the power of the tests discussed in the next section. The power of 
the tests at any value other than 8 = is also considered in the example to study the behavior 
of the power functions corresponds to the probabilities of type I and type II errors. Moreover, 
the study of relationship between the level of significance for the PTT and the nominal size of 
the PT as well as the nominal sizes of the UT and RT are explored. 

7 Illustrative Example - Power Comparison 

The asymptotic power functions for the UT, RT and PTT are compared in this section. Under 
{K n }, we note 

(i) n^ T (Ai,A 2 ) as the asymptotic power function for testing Hq : 9 = when (5 is assumed 

to be undefined in the construction of the test statistic T^ T , 

(ii) II PT (Ai,A 2 ) as the asymptotic power function for testing Hq : 8 = when j3 is assumed 
to be zero in the construction of the test statistic T RT and 

(iii) n PTT (Ai,A 2 ) as the asymptotic power function for testing #q : 9 = after pre-testing 

: = 0. 
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For this illustrative example, the random errors of the simple linear model are generated 
from Normal distribution with mean and variance 1. The sample size is n = 1000. Three 
sets of values: and 1 with 50% for each for the first set, —1 and with 50% for each for the 
second set and —1 and 1 with 50% for each for the third set are considered as the values of 
the regressor a, i = 1, 2, . . . , 1000. These values guarantee c > 0, c = and c < respectively 
to the sets of regressors. 

In this example, the ip function is taken as Huber ip function (Hoaglin et al., 1983, p. 366, 
Wilcox, 2005, p. 77), is defined as 



—k if Ui < —k 
Ui if | Ui | < k 
k if m > k, 



where Uj = Xi — 6 — (3ci. As suggested in many reference books (Wilcox, 2005, p. 76), the value 
of k = 1.28 is chosen because k = 1.28 is the 0.9 quantile of a standard normal distribution, 
there is a 0.8 probability that a randomly sampled observations will have a value between —k 
and k (Wilcox, 2005, p. 76). The estimate for do is taken to be ^^(u) 2 /n. The estimate for 7 
is ^^'(uj/n (Caroll and Rupert, 1988, p. 212) where 



V>'(u) 



if u < -1.28 

1 if |u| < 1.28 
if u > 1.28. 



The U PTT , U RT and U UT are calculated using the formulas given by equations (15. 5p . (15. 7p 
and (|5,9p . The R-package (mvtnorm) is used in computing the bivariate Normal probability 
integral. 

In Figure 1, the power functions for the UT, RT and PTT are plotted against A2 at two 
values of Ai. Here Ai = is chosen to study the asymptotic sizes of the tests and we desire 
the size of a particular test to be small so that the probability of type I error is small. Since 
we also wish to get small value of probability of type II error, the power of the test at Ai = 2 
is considered. An acceptable power function of the test is the one that is small when the null 
hypothesis is true but large when Ai differs much from 8 = 0. The first set of regressors is 
used to plot Figures 1(a) and 1(b). As A2 grows larger, IL RT (0, A2) approaches 1. However, 
H PTT (0, X2), after an initial increase, drops and converges to the nominal size a = 0.05 as A2 
grows larger. Thus, the asymptotic size (with very small Ai) of 4> PTT is close to a for small 
A2 and large A2, while for moderate values of A2 it is somehow larger than a but lesser than 
that of H RT (0, A2). The TL UT (0, A2) is constant and does not depend on Ag- The same pattern 
occurs in Figure 1(b) but the power functions are always significantly larger than a, in this 
case larger than 0.4. If one only considers the size of the test, the PTT is preferred to RT, 
though the UT remains as the best choice. However, the RT is the best choice but the PTT 
is preferred to UT if the power of the test at Ai = 2 is considered. It is impossible to obtain 
a test that uniformly minimizes the size and maximizes the power at the same time. We are 
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Figure 1: Graphs of power functions as a function of A2 for selected values of Ai and ol\ = = 
«3 = a = 0.05. Dotted line, solid line and line with star represent H UT (Xi, X2), H RT (Ai, A2) 
and n PTT (Ai, A2) respectively. Graphs (a) and (b) are for c > 0, (c) and (d) are for c = and 
(e) and (f) are for c < 0. 
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looking for a test that is a compromise between minimizing the size and maximizing the power 
(small probabilities of type I and type II errors). The RT is the best choice for its largest 
power but the worst choice for its largest size as A2 grows larger. On the contrary, the UT 
is the best choice for its smallest size but the worst choice for its smallest power. Both RT 
and UT uniformly minimize or maximize the size and power at the same time. The PTT has 
larger power than the UT for small and moderate values of A2 and it has significantly smaller 
size than that of the RT for moderate and large A2. Therefore, if our objective is to obtain a 
test that has better probabilities for both type I and type II errors, the PTT is suggested as 
the best option. The PTT is a compromise between minimizing the size and maximizing the 
power than the RT and UT. 

The cases for c = and c < are also considered in this paper, though c > is more 
important than the other two because it is more realistic. Setting c = in Figures 1(c) 
and 1(d) imply all power functions remain the same regardless of the value of A2 and these 
constant power functions increase as Ai increases. Figures 1(e) and 1(f) illustrate the case 
when c < 0. The graphs show that II RT < Yl PTT for any A2 and Y[ PTT < Tl UT for any 
A2 more than a small positive value, say Ao- The probability of type I error for all test 
functions are fairly small. The size and power of the RT is decreasing to as A2 growing larger 
(Figures 1(e) and 1(f)) suggesting the RT as the best choice for size but the worst choice for 
power. Since n PTT (2,A 2 ) > IP RT (2,A 2 ) for all A 2 , the PTT is preferred over the RT . Also, 
H PTT (2, A2) > H UT (2, A2) except for some moderate values of A2 but the difference is relatively 
small. From the examination of all the graphs in Figure 1, the PTT is suggested as the best 
choice when both probabilities of type I and type II errors are considered. 

The relation between power functions and Ai is shown in Fig 2. All power functions are 
approaching 1 as Ai grows larger regardless of the value of A2. This is because the probability 
of rejecting Hq : 6 = increases as Ai increases. When c > 0, the probability of type II error 
for the RT is the smallest, but the PTT is preferable than the UT for all values of \\. When 
c < 0, the PTT is preferable for its comparatively smaller probability of type II error than the 
other two tests. When c = 0, all tests have the same probability of type II error regardless of 
the value of Ai (refer to the equation (|6.2p for analytical result). 

Figure 3 illustrates the behavior of the power function H PTT (X\, X2) at three different 
values of nominal size C13. The graphs show that the test with smaller nominal significance 
level has greater power than that of larger significance level. The smaller nominal significance 
level however increases the probability of type I error as A2 moves away from zero. This is 
illustrated in Fig 3(b), I[ PTT (\i,2) at a.3 = 0.005, 0.05, and, 0.1 start at different values 
before growing larger and converging to 1. 

It is of advantage to study the relationship between the size of the PTT, that is, a PTT = 
H PTT (0, A2) and the nominal significance level of the PT, 0:3. One may want to know what 
suppose to be the actual level of significance of the PT that will reject the ultimate test with 
a predetermined probability, says 5 percent. Taking a\ and 02 to be equal, here 0.05, the size 
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Figure 2: Graphs of power functions as a function of Ai for selected values of A2 and ot\ = = 
«3 = a = 0.05. Dotted line, solid line and line with stars represent II (Ai, A2), H RT (Ai, A2) 
and n PTT (Ai, A2) respectively. Graphs (a) and (b) are for c > 0, (c) and (d) are for c = and 
(e) and (f) are for c < 0. 
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(a) (b) 




Figure 3: Graphs of power function U PTT (Xi, A2) for nominal sizes 03 = 0.005, 0.05 and 0.1. 
The c > and 02 = a\ = a = 0.05 for all graphs. 



of the test depends on A2- Figure 4 shows the graphs of II (0, A2) against «3 for different 
values of A2 with ol\ = «2 = 0.05 and c > 0. For smaller values of A2, as CK3 increases, the size 
of the PTT decreases and reaches its minimum at the value of 0:3 = a 3 (say), before growing 
larger and converging to a = 0.05. Let the value of 03 be a 3 when the size of the PTT is 0.05, 
the value of a 3 increases as A2 increases. As we consider larger values of A2, the size of the 
PTT decreases dramatically then slowly converges (appears as flat in the graph) to a at some 
positive value a 3 ". Table 1 gives the values of the size of the PTT at 03 for different values of 
A2, with a\ = «2 = 0.05 when c > 0. If we want to reject the ultimate test with significance 
level 0.05, the nominal significance level of the PT must be set to when A2 is 0. Then a 
larger but still acceptable nominal size 0:3 is required to achieve 5% significance level of the 
PTT as A2 is a bit larger than 0. But up to some point, we cannot sacrifice the increases in the 
probability of type I error of the PT as A2 grows much than 0. As A2 grows larger, a larger 03 
is required to obtain 5% significance level of the PTT (see Table 1). Note: Setting the nominal 
size of the PT to is meaningless because this means there is no chance that : j3 = is 
rejected. The size and power of the PTT converges to the RT when 03 approaches (refer 
equation (|6.4|) ). thus supports this result in Table 1. 

Figure 5 shows graphs of a PTT = II PTT (0,A2) for < 03 < 1 at selected values of A2, 
a\ and ai when c > 0. Equations (|6.4p and (|6.5p show that the size and power of the PTT 
is approaching the size and power of the RT as the nominal size of PT is closer to but 
is approaching the size and power of the UT as the nominal size of the PT is closer to 1. 
From equation (|6.4|) . setting the nominal significance level 03 = implies the size and power 
of the PTT is entirely contributed by the size and power of the RT and none from the UT. 
The contribution of the size and power of the UT to the size and power of the PTT is not 
substantial when the nominal size of the PT is small. From the graphs, the decreasing in the 
contribution of the size of the RT reduces the size of the PTT as 03 differs from zero. On the 
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igure 4: Graphs of size of the PTT (a PTT = II PTT (0, A2)) as 03 and A2 increasing when 
> and a\ = ct2 = 0.05 for all graphs. 
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Table 1: Size of ultimate test (a PTT ) as a function of nominal size of PT (as) at selected 
values of A2 and a\ = a 2 = a = 0.05. 



A 2 


a 3 


a PTT 


a 3 


a PTT 





0.05 


0.0479 


0.00 


0.0500 


0.1 


0.10 


0.0495 


0.05 


0.0525 


0.2 


0.20 


0.0476 


0.15 


0.0509 


0.4 


0.30 


0.0475 


0.25 


0.0515 


0.6 


0.35 


0.0493 


0.30 


0.0540 


0.8 


0.40 


0.0498 


0.35 


0.0547 


1.0 


0.45 


0.0491 


0.40 


0.0543 


2.0 


0.60 


0.0476 


0.55 


0.0508 


4.0 


0.65 


0.0500 


0.60 


0.5040 


6.0 


0.70 


0.0500 


0.65 


0.0500 


8.0 


0.75 


0.0500 






10.0 


0.75 


0.0500 







The a is the actual achievable significance level and 013 is the nominal PT significance level. 



contrary, setting the nominal size 03 = 1 causes the size of the PTT is totally contributed by 
the size of the UT (see equation (]6.5p ) . The contribution of the size of the RT is not significant 
when the nominal size of the PT is large. As the value of a 3 differs from 1, lesser contribution 
from the size of the UT imposes smaller size of the PTT. The size of the PTT decreases from 
both ends and the minimum of the size of the PTT is achieved at a particular value of a 3 . 

Further, analysis is carried out to investigate the dependence of the size of the ultimate 
test to the changes in the nominal sizes a±, a 2 and 03. From observation of Figures 5(a)-5(f), 
there is an increase in the percentage of ot PTT in [0,0.1] for 0:3 in [0,0.2] when we set smaller 
nominal size of a 2 for a bit larger value of A 2 . For example, there is 47.62% of n PTT (0, 1) 
in [0,0.10] for 03 in [0,0.2] at nominal size a 2 = 0.05 (see Figure 5(a)) but there is 100% of 
n PTT (0, 1) when q 2 = 0.03 (see Figure 5(b)). For some moderate values of A 2 , there is an 
increment in the percentage of il i3 " rT (0, A 2 ) when we choose a smaller nominal size a 2 . But 
only small increment is observed for a larger value of A 2 . For example, there is no H PTT (Q, 3) 
in [0,0.10] for 03 in [0,0.2] when we set the nominal size to be a 2 = 0.05 (see Figure 5(a)) but 
there is a slightly 4.76% of n PTT (0, 3) when a 2 = 0.03 (see Figure 5(b)). The small increment 
suggests setting a much smaller value of nominal size a 2 maybe necessary to achieve a small 
size of PTT with small nominal size of pre-test for moderate values of the slope. However, this 
rule fails for a large value of A 2 . 

We wish to have small size of the PTT by setting small nominal sizes of ai,a 2 and a 3 ■ 
Figure 5 shows that this could not be achieved when A 2 is large and a 3 is very small (close 
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Figure 5: Graphs of the size of ultimate test for increasing 03 selected at different values of 
nominal sizes of a± and ai with c > 0. The intersection with the vertical line represents the 
minimum. 
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Table 2: Size of ultimate test (a PTT ) as a function of nominal size of pre-test (03) at selected 
values of 0:2 and A2 with a\ = 0.05. 



A 2 = 0.5 


A 2 = 1 


A 2 = 3 


A 2 = 6 


«2 


03 


a PTT 


«2 




a PTT 




a 3 


a PTT 




03 


a PTT 


0.03 


0.03 


0.0498 


0.01 


0.00 


0.0355 


0.01 


0.08 


0.0994 


0.03 


0.04 


0.0983 


0.02 


0.0507 


0.01 


0.0343 


0.07 


0.1052 


0.03 


0.1145 


0.04 


0.19 


0.0499 


0.02 


0.00 


0.0623 


0.02 


0.15 


0.0987 


0.04 


0.05 


0.0891 


0.18 


0.0508 


0.01 


0.0605 


0.14 


0.1029 


0.04 


0.1002 


0.05 


0.31 


0.0497 


0.03 


0.00 


0.0870 


0.03 


0.20 


0.0965 


0.05 


0.05 


0.0901 


0.30 


0.0507 


0.01 


0.0839 


0.19 


0.1005 


0.04 


0.1014 


0.06 


0.34 


0.0508 


0.04 


0.02 


0.1026 


0.04 


0.22 


0.1007 


0.06 


0.04 


0.1024 


0.04 


0.0499 


0.03 


0.0999 


0.23 


0.0972 


0.05 


0.0909 


0.07 


0.48 


0.0491 


0.05 


0.11 


0.0993 


0.05 


0.25 


0.0987 


0.10 


0.05 


0.0093 


0.47 


0.0500 


0.10 


0.1015 


0.24 


0.1021 


0.04 


0.1045 



The a is the actual achievable significance level and 0:3 is the nominal pre-test significance level. 



to zero) even if we set a very small value of «2- For instance, there is less than 100% (i.e. 
80.95%) of n PTT (0,6) in [0,0.10] as a 3 in [0,0.2] (see Figures 5(a) and 5(b)) for both nominal 
sizes «2 = 0.03 and 02 = 0.05. The percentage does not reach 100% even < 02 < 0.03 is 
chosen. 

Since a PTT behaves like a RT when the nominal size 03 is small, the null hypothesis Hq : 
9 = is rejected more often for small nominal size of 03 when A2 is large because the nominal 
size c*2 is smaller than the actual size of the RT. The null hypothesis H£ : 9 = should not 
be rejected if the true value of 9 = 0. In this case, however the possibility of rejection is large 
when A2 differs much from because (5 = is assumed in the test statistic T PT . This fact 
answers the reason why very small 03 (close to zero) has a very large size of the test when A2 
is large. 

Table 2 shows the size of ultimate test as a function of nominal size 03 for selected values 
of A2 and «2 with ol\ = 0.05 and c > 0. The nominal sizes for the RT and PT are given in 
the table for the size of ultimate test near point 0.05 when A2 = 0.5 and 1 and near point 0.10 
when A2 = 3 and 6. The table enables us to observe the changes in the values of the nominal 
size of PT (03) as the nominal size «2 changes and the significance level of the PTT is around 
the same value. We wish to have small nominal size of the PT that allow us to get 5 or 10% of 
significance level of the PTT. From the table, this is achieved by selecting smaller nominal size 
of the RT for moderate and small values of slope. When A2 = 3 (moderate value), selecting 
nominal size 02 as small as 0.01 we have as much as 8% of nominal size of the PT to get below 
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than 10% significance level of ultimate test (see Table 2, row:l, col:7-9). In column 1-3 of the 
table, for A2 = 0.5 (small), approximately 5% level of significance of ultimate test is obtained 
by setting nominal size of the RT = 0.05 and nominal size of the PT = 0.3 or by setting both 
nominal sizes of the RT and PT = 0.03 but the latter with smaller nominal sizes of the PT 
and RT is more preferable. For larger value of the slope, as the nominal size of the PT closer 
to 0, the size of the PTT is growing too large. When A2 = 6 (large), to obtain at most 10% of 
significance level of ultimate test, the least nominal size for the PT that we should set is 5% 
(see Table 2, row:3, col:10-12) when the nominal size «2 is set from 0.05 to 0.10. 

8 Concluding Remarks 

The M-test of the UT, RT and PTT for testing the intercept are provided in this paper. The 
asymptotic power functions of the tests are derived by using the results from the asymptotic 
sampling distribution of the statistics. 

In the estimation regime, it is well known that the RE has the smallest MSE if distance 
parameter (a function of (3 — /3q) is or close to 0, but its MSE is unbounded for larger values 
of the distance parameter. The UE has a constant MSE that does not depend on the distance 
parameter. The PTE has smaller MSE than that of the RE for moderate and larger values 
of the distance parameter. The PTE has smaller MSE than the UE if the value of distance 
parameter is close or equal to 0. In the testing context, the power functions of the UT, RT 
and PTT demonstrate a similar behavior as the MSE of the UE, RE and PTE. 

For a set of realistic values of the regressor, with mean value larger than 0, the size of the 
RT is small when (3 = or close to 0, but the size grows large and converges to 1 for larger 
values of the slope. The UT has a constant size regardless of the value of the slope (via A2). 
The PTT has smaller size than that of the RT when the slope is and very close to 0, and 
significantly smaller than that of the the RT for moderate and large values of the slope. The 
PTT has smaller size than the UT for the value of slope is or very close to 0. 

Again for a set of realistic values of the regressor, with mean larger than 0, the RT is the 
best choice for having largest power but the worst choice for having largest size. The size of 
the UT is constant regardless of the value of the slope. The UT is the best choice for having 
smallest size but the worst choice for having smallest power. The PTT has smaller size than 
the RT for moderate and larger values of the slope and has larger power than the UT for 
smaller and moderate values of the slope. Therefore, the power function of the PTT is found 
to behave similar to the MSE of the PTE in the sense that though it is not uniformly the best 
statistical test with the smallest size and the largest power but it protects from the risk of a 
too large size and a too small power. Thus, the power function of the PTT is a compromise 
between that of the UT and RT. In the face of uncertainty on the value of the slope, if the 
objective of a researcher is to minimize the size and maximize the power of the test, the PTT 
is the best choice. 
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The tables and graphs support the analytical asymptotic comparison of the UT, RT and 
PTT as discussed in Section 6. The analysis is furthered by investigating the relationship 
between the power functions and its arguments, namely the slope and the nominal sizes, of 
the UT, RT and PT. The chosen values of the nominal sizes that are set before testing affect 
the actual size of the PTT. 

In order to get small probability of type I error for the ultimate test, our investigations 
concentrate on small nominal sizes of the UT, RT and PT with a view to achieving small 
(actual) significance level of the PTT. The study revealed that for small and moderate values 
of slope, the smaller the nominal size of the RT, the smaller the size of the PTT when other 
nominal sizes are kept fixed and small. For moderate and large values of the slope, a large size 
of the PTT is observed when nominal size of PT is set close to 0. The size of the PTT behaves 
much like that of the RT when the nominal size of PT is small, but it behaves more like that 
of the UT when the nominal size of the PT is large. 

The power of the ultimate test is larger for moderate values of the slope than for smaller 
and larger values of the slope. It is shown analytically that the power of the PTT approaches 
the power of the RT when the nominal size of PT is closer to but approaches the power of 
the UT when the nominal size of the PT is closer to 1. In practical applications, size of the 
PT should be small (ideally close to 0), and in such cases the power of the PTT is close to 
that of the RT (which is much higher than that of the UT). To avoid the larger size of the RT, 
practitioners are recommended to use the PTT as it achieves smaller size (than the RT) and 
higher power (than the UT) when the value of the slope is small or moderate. Even for large 
values of the slope the PTT has at least as much power as the UT. 
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